Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training

Add code
Mar 25, 2026
Viaarxiv icon

LED: A Benchmark for Evaluating Layout Error Detection in Document Analysis

Add code
Mar 18, 2026
Viaarxiv icon

The COTe score: A decomposable framework for evaluating Document Layout Analysis models

Add code
Mar 16, 2026
Viaarxiv icon

PromptDLA: A Domain-aware Prompt Document Layout Analysis Framework with Descriptive Knowledge as a Cue

Add code
Mar 10, 2026
Viaarxiv icon

Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Add code
Mar 11, 2026
Viaarxiv icon

GLM-OCR Technical Report

Add code
Mar 11, 2026
Viaarxiv icon

ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts

Add code
Mar 10, 2026
Viaarxiv icon

MITRA: An AI Assistant for Knowledge Retrieval in Physics Collaborations

Add code
Mar 10, 2026
Viaarxiv icon

Towards Khmer Scene Document Layout Detection

Add code
Feb 28, 2026
Viaarxiv icon

DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion

Add code
Feb 25, 2026
Viaarxiv icon